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Abstract 

The  transcriptional  repertoire  of  the  in  vivo  liver  stage  of  Plasmodium  has  remained  largely  unidentified  and  seemingly  not  amenable 
to  traditional  molecular  analysis  because  of  the  small  number  of  parasites  and  large  number  of  uninfected  hepatocytes.  We  have  overcome 
this  obstruction  by  utilizing  laser  capture  microdissection  to  provide  a  high  quality  source  of  parasite  mRNA  for  the  construction  of  a  liver 
stage  cDNA  library.  Sequencing  and  annotation  of  this  library  demonstrated  expression  of  623  different  Plasmodium  yoelii  genes  during 
development  in  the  hepatocyte.  Of  these  genes,  25%  appear  to  be  unique  to  the  liver  stage.  This  is  the  first  comprehensive  analysis  of  in  vivo 
gene  expression  undertaken  for  the  liver  stage  of  P.  yoelii ,  and  provides  insights  into  the  differential  expression  of  P.  yoelii  genes  during  this 
critical  stage  of  development. 

©  2005  Elsevier  B.V.  All  rights  reserved. 
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1.  Introduction 

Parasites  belonging  to  the  genus  Plasmodium  have  a  com¬ 
plex  lifecycle  that  requires  the  stage  specific  expression  of 
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different  proteins  to  complete  development  in  both  the  in¬ 
sect  vector  and  vertebrate  host.  It  has  been  intuitively  and 
empirically  understood  that  the  parasites  express  different 
sets  of  proteins  during  development  and  that  these  proteins 
can  be  shared  or  unique  to  specific  stages.  There  are  nu¬ 
merous  examples  of  the  stage  specific  expression  of  parasite 
genes  at  the  sporozoite  and  blood  stage  [1-5],  but  experimen¬ 
tal  evidence  identifying  the  breadth  of  this  regulated  gene 
expression  has,  until  recently,  been  lacking.  However,  re¬ 
cent  work  has  shown  that  cDNA  libraries  can  be  constructed 
to  generate  expressed  sequence  tags  (ESTs)  or  expression 
profiling  can  be  done  by  high-density  oligonucleotide  array 
analysis.  These  methodologies  can  provide  significant  infor¬ 
mation  about  gene  expression  [6,7],  with  the  EST  library 
results  having  been  used  to  define  the  transcriptome  of  the 
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parasites  during  their  development  [8].  The  libraries  have 
been  quite  useful,  but  it  has  been  the  completed  shotgun  se¬ 
quencing  of  several  Plasmodium  genomes  [9,10],  which  has 
truly  allowed  the  effective  utilization  of  these  sequence  li¬ 
braries.  The  resulting  database  has  become  available  and  can 
be  effectively  used  for  gene  discovery  and  the  identification 
of  gene  expression  from  the  sporozoite  and  blood  stage. 

The  utilization  of  the  Plasmodium  yoelii  rodent  model 
for  malaria  has  facilitated  the  identification  of  stage  spe¬ 
cific  gene  expression  and  allowed  the  identification  of  many 
parasite  genes  [11-14],  A  significant  amount  of  data  now 
exists  to  support  the  studies  of  gene  expression  during  the 
sporozoite  and  blood  stages  of  the  parasites’  development. 
Unfortunately,  gene  expression  analysis  during  in  vivo  liver 
stage  development  has  not  been  amenable  to  the  types  of 
molecular  investigations  that  would  serve  to  identify  differ¬ 
entially  expressed  genes  [15].  One  of  the  major  impediments 
in  identifying  parasite  gene  expression  at  the  liver  stage  has 
remained  the  large  number  of  uninfected  hepatocytes  relative 
to  the  number  of  infected  hepatocytes  in  the  liver  after  sporo¬ 
zoite  inoculation.  Isolation  of  RNA  from  an  infected  liver 
results  in  an  abundance  of  host  RNA  and  a  minor,  though 
detectable  [16]  amount  of  parasite  message.  Current  work 
by  one  group,  to  get  around  this  problem,  has  utilized  an  in 
vitro  culture  method  to  transform  sporozoites  into  early  exo- 
erythrocytic  forms  (EEFs)  and  subsequently  make  an  EST 
library  from  isolated  parasite  RNA  [17].  We  chose  an  ap¬ 
proach  that  allowed  us  to  look  at  the  in  vivo  expression  of 
parasite  mRNA  during  liver  stage  development.  Recent  in¬ 
novations  in  microdissection  have  dramatically  improved  our 
abilities  to  separate  schizonts  from  hepatocytes.  Work  from 
our  laboratory  demonstrated  that  laser  capture  microdissec¬ 
tion  (LCM)  could  be  used  to  recover  liver  stage  parasites 
from  P.  yoelii  infected  mouse  liver  and  that  the  recovered 
schizonts  were  virtually  free  of  contaminating  host  tissue 
[18].  These  results  clearly  presented  an  opportunity  to  con¬ 
struct  a  cDNA  library,  from  isolated  schizonts  that  would  be 
almost  exclusively  derived  from  parasite  mRNA  and  thus 
avoid  the  contamination  of  the  library  with  host  derived 
sequences. 

We  now  report  the  generation,  sequencing  and  annota¬ 
tion  of  a  P.  yoelii  liver  stage  EST  library  that  has  sup¬ 
plied  significant  information  about  the  in  vivo  expression 
of  P.  yoelii  genes  during  this  critical  stage  of  the  para¬ 
site’s  lifecycle.  This  is  the  first  analysis  of  gene  expres¬ 
sion  during  in  vivo  liver  stage  development  and  as  such 
has  allowed  the  initial  identification  of  both  constitutively 
and  developmentally  regulated  gene  transcripts  as  they  ap¬ 
pear  in  the  infected  hepatocyte.  This  work  complements  and 
extends  the  previous  studies  done  to  identify  the  transcrip- 
tome  and  proteome  expressed  during  other  stages  of  the 
parasites’  development.  Taken  together,  the  analysis  of  P. 
yoelii  liver  stage  gene  expression  offers  a  unique  view  of 
the  biology  of  plasmodial  parasites  and  the  possibility  of 
identifying  regulated  gene  function  throughout  the  parasite’s 
lifecycle. 


2.  Materials  and  methods 

2.1.  Parasite  preparation 

A  B ALB/c  mouse  was  infected  with  5  x  106  of  P.  yoelii 
17  x  NL  sporozoites.  The  Infected  liver  was  harvested  40 h 
later  and  processed  for  cryosectioning  as  previously  reported 
[18]. 

2.2.  Laser  capture  microdissection  and  RNA  extraction 

Laser  capture  microdissection  of  liver  stage  schizonts 
was  accomplished  using  a  Pixcell  laser  capture  microscope 
equipped  with  an  infrared  diode  laser  (Arcturus  Engineering, 
Santa  Clara,  CA)  as  previously  reported  [18], 

Briefly,  tissue  sections  on  glass  slides,  were  fixed  in 
70%  ethanol,  stained  with  hematoxylin  and  eosin,  dehy¬ 
drated  and  air-dried.  RNAse  inhibitor  (SUPERase  In™, 
Ambion,  Austin,  TX)  was  included  in  all  aqueous  staining 
and  washing  steps  to  limit  the  possibility  of  RNA  degra¬ 
dation.  Each  dehydrated  tissue  section  was  placed  on  the 
microscope  stage  and  a  transfer  cap  placed  on  the  tissue. 
Individual  schizonts  were  captured  onto  the  thermoplastic 
membrane,  mounted  onto  the  transfer  cap,  by  melting  of 
the  membrane  via  laser  activation.  Typically,  the  laser  spot 
size  was  7.5  p,m,  the  power  40  mW  and  the  pulse  duration 
3  ms. 

Total  RNA,  from  cohorts  of  caps  containing  1500  laser 
microdissected  schizonts,  was  isolated  using  the  StrataPrep 
Total  RNA  Microprep  Kit  from  Stratagene  (LaJolla,  Califor¬ 
nia)  following  the  manufactures’  directions. 

2.3.  cDNA  library  construction  and  sequencing 

Total  RNA,  from  LCM  recovered  parasites,  was  Dnase  I 
treated  to  remove  possible  genomic  DNA  contamination.  To¬ 
tal  RNA  was  transcribed  into  cDNA  and  amplified  using  the 
SMART™  cDNA  Synthesis  kit  (Clontech,  Palo  Alto,  CA). 
After  amplification  for  18  cycles,  cDNA  synthesis  was  eval¬ 
uated  by  running  on  an  agarose  gel  and  then  digested  with 
Sfi  I  and  size  fractionated  on  a  CHROMA  SPIN  column. 
Only  fractions  showing  cDNA  larger  than  300  bp  were  pooled 
and  ligated  into  Sfi  I-digested  \TriplEx2  Vector  (Clontech, 
Palo  Alto,  CA).  The  cDNA  library  was  packaged  using  the 
Gigapack  III  Gold  Packaging  extract  (Stratagene,  La  Jolla, 
CA).  Single  phage  plaques  were  picked  and  converted  into 
pTriplEx2  plasmid  clones  using  BM25.8  host  cells  and  a 
protocol  recommended  by  Clontech.  A  preliminary  evalu¬ 
ation  of  96  clones  indicated  that  the  cDNA  library  contained 
approximately  80%  recombinant  clones.  DNA  was  isolated 
from  2628  of  clones  using  a  mini-prep  kit  from  Millipore 
(Bedford,  MA)  and  sequenced  using  both  pTriplEx2-specific 
forward  and  reverse  primers  and  the  BigDye  termina¬ 
tor  method  on  a  3100  Genetic  Analyzer  (ABI,  Foster 
City,  CA). 
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2.4.  Assemblies  and  database  searches 

Sequences  were  trimmed  from  their  vector  and  primer 
contaminants  using  a  program  written  in  Visual  Basic,  as 
described  before  [19].  Only  those  sequences  having  >99 
nucleotides  (nt)  and  less  than  10%  N  (undetermined  nu¬ 
cleotide  base)  were  used.  Trimmed  sequences  were  clustered 
by  first  comparing  the  set  against  itself  using  BlastN  [20] 
without  filters  (FF  options),  and  clustering  the  sequences 
presenting  at  least  90%  identity  over  a  100  nucleotide  (nt) 
length.  These  clusters  were  further  assembled  with  the  CAP 
program  [21],  which  could  generate  more  than  one  CAP 
cluster  per  Blast  cluster.  The  assembled  sequence  data  was 
further  compared  to  several  other  databases,  using  Blastx, 
BlastN  or  rpsBlast,  as  follows.  The  non-redundant  protein 
database  set  of  the  National  Center  for  Biotechnology  Infor¬ 
mation  (NCBI),  a  group  of  human  and  Drosophila  protein 
sequences  annotated  by  the  Gene  Ontology  consortium  [22], 
the  Conserved  Domains  Database  of  the  NCBI  (http://www. 
ncbi.nlm.nih. gov/Sitemap/index. html#CDD)  containing  the 
PFAM  [23],  SMART  [24],  and  COGs  [25]  alignments 
of  protein  motifs,  three  subsets  of  the  NCBI  nucleotide 
databases  containing  all  mitochondrial,  rRNA  and  mouse 
EST  sequences,  all  Plasmodium  falciparum  protein  se¬ 
quences  deposited  on  NCBI,  the  nucleotide  sequences  for  the 
genome  and  CDS  of  P.  yoelii  found  for  public  download  at 
ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/p_yoelii/,  and 
the  stage  specific  cDNA  sequences  from  the  blood-  and 
mosquito  salivary  gland-stages  of  P.  yoelii  [8]  The  results 
of  these  Blast  matches  were  directed  to  a  hyperlinked  Ex¬ 
cel  spreadsheet  that  facilitated  the  task  of  manually  anno¬ 
tating  the  database.  In  order  to  compare  gene  expression  of 
P.  yoelii  in  the  three  stage  specific  libraries,  the  blood  and 
salivary  gland  databases  were  submitted  to  the  same  proce¬ 
dures  (vector  trimming  and  clustering)  as  done  for  the  liver 
stage  library,  and  the  resulting  contigs  of  each  library  blasted 
against  the  others  with  a  cutoff  of  IE-15.  Blast  searches  were 
done  locally  from  executables  obtained  at  the  NCBI  FTP 
site  (ftp://ftp.ncbi.nih.gov/blast/executables/).  Using  similar 
methodologies  to  those  described  above,  an  analysis  was 
done  to  identify  P.  falciparum  homologs  in  the  P.  yoelii  EST 
library.  As  in  the  initial  Blast  analysis,  a  cutoff  of  IE-15 
was  used. 

2.5.  RT-PCR  analysis 

RNA  for  use  in  RT-PCR  analysis,  was  isolated  from  P. 
yoelii  sporozoites,  infected  liver  and  infected  blood  as  previ¬ 
ously  described  [16].  First-strand  cDNA  was  generated  from 
total  RNA  using  the  Superscript  First-Strand  Synthesis  Sys¬ 
tem  for  RT-PCR  kit  (Life  Technologies,  Gaithersburg,  MD). 
The  synthesis  of  cDNA  was  performed  by  priming  RNA, 
isolated  from  the  different  parasite  stages  with  either  random 
hexamers  or  oligo-dt,  and  then  incubation  with  reverse  tran¬ 
scriptase  (RT+).  As  a  control  for  the  presence  of  genomic 
DNA,  reactions  were  done  omitting  the  reverse  transcriptase 


(RT— ).  Amplification  of  specific  gene  sequences  was  accom¬ 
plished  by  PCR  using  a  hot  start  Taq  DNA  polymerase  from 
the  FlotStarTaq  PCR  kit  (Quiagen,  Valencia,  CA).  One  micro¬ 
liter  from  the  cDNA  reaction  was  added  to  a  PCR  master  mix 
with  one  of  several  oligonucleotide  primer  pairs  (Table  S6). 
The  primers  were  designed  and  the  specific  conditions  used 
for  the  PCR  reactions  were  determined  by  using  Mac  Vector 
sequence  analysis  software  (Accelrys,  San  Diego,  CA).  PCR 
products  were  electrophoresed  on  a  1%  agarose  gel,  stained 
with  ethidium  bromide  and  the  DNA  bands  visualized  with  an 
Alphalmager™  (Alpha  Innotech  Corporation,  San  Leandro, 
CA).  RT-PCR  products  were  cloned  into  the  TA  cloning  vec¬ 
tor  and  sequenced  to  verify  that  they  matched  to  the  targeted 
gene. 

3.  Results 

3.1.  Isolation  of  P.  yoelii  liver  stage  schizonts,  RNA 
extraction  and  CDNA  library  construction 

P.  yoelii  schizonts  were  captured  by  LCM  from  mouse 
liver  tissue  40  h  after  sporozoite  infection  as  previously  re¬ 
ported  [18].  Parasites  were  not  fully  mature  but  could  be  eas¬ 
ily  visualized  by  hematoxylin  and  eosin  staining.  Total  RNA 
was  extracted  from  approximately  1500  LCM  schizonts  and 
a  cDNA  library  constructed  from  1  pig  of  total  RNA. 

Isolated  RNA  was  reverse  transcribed  and  amplified  using 
the  SMART™  kit.  A  portion  of  the  amplified  material  was  as¬ 
sessed  by  agarose  gel  electrophoresis  (Fig.  1),  demonstrating 
the  production  of  significant  product  above  300  bp  that  was 
then  size  fractionated  for  ligation  into  the  ITriplEx2  vector. 

3.2.  EST  library  analysis 

A  total  of  2628  clones  were  sequenced  with  1661 
clones  containing  good  quality  readable  recombinant  inserts. 
The  average  length  of  each  insert  was  413  bp.  All  of  the 
ESTs  have  been  deposited  with  GenBank  (accession  nos. 
CK573044-CK574949). 

The  1661  EST  sequences  were  grouped  into  1345  clusters, 
based  upon  ESTs  having  overlapping  sequence.  These  clus¬ 
ters,  containing  one  or  more  overlapping  sequences,  were 
then  assembled  into  contigs.  Only  1%  of  the  EST  clusters 
matched  mouse  sequence,  while  91%  matched  the  P.  yoelii 
genome  and  72.4%  matched  the  predicted  CDS  of  P.  yoelii 
(Table  1).  From  the  contigs,  623  non-redundant  genes  were 
identified.  The  difference  between  the  number  of  matches  for 
the  genome  and  predicted  genes  may  be  due  to  ESTs  from 
UTRs  or,  less  likely,  ESTs  from  unidentified  genes.  The  ma¬ 
jority  of  the  sequences  that  had  no  match  to  the  database, 
were  of  relatively  low  complexity  and  had  GC  content  per¬ 
centages  that  were  50%  or  more.  The  AT  content  of  P.  yoelii 
genes  is  generally  greater  than  70%,  suggesting  that  these  low 
complexity  sequences  were  not  derived  from  parasite  genes. 
Curiously,  a  small  number,  six  sequences,  of  the  ESTs  that 
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Fig.  1.  Agarose  gel  analysis  of  linearly  amplified  cDNA.  RNA,  isolated 
from  40  h  liver  schizonts  was  reverse  transcribed  and  then  amplified  using 
the  SMART™  cDNA  Synthesis  kit  (Clontech.  Palo  Alto,  CA).  Lane  1, 
amplified  cDNA  from  LCM  isolated  Plasmodium  yoelii  schizonts.  Lane  2, 
molecular  weight  markers. 


did  not  have  significant  homology  to  P.  yoelii  genomic  se¬ 
quence  did  have  high  homology  to  P.  falciparum  genomic 
sequences.  Internal  primers  were  constructed  from  three  of 
the  ESTs  (Py  EST  248,  Py  EST  516  and  Py  EST  892)  and 
used  to  successfully  amplify  mRNA  from  liver  stage  para¬ 
sites.  These  sequences  appear  to  represent  genes  that  were 
missed  in  the  initial  annotation  of  the  P.  yoelii  genome  shot¬ 
gun  sequencing  project  [9]. 

To  define  the  differential  expression  of  P.  yoelii  genes,  the 
EST  sequences  obtained  from  mosquito  salivary  gland,  liver, 
and  blood  stage  cDNA  libraries  were  Blast  analyzed  against 
the  predicted  CDS  of  P.  yoelii  with  aBlastN  cutoff  of  IE  —  12, 
thus  identifying  genes  expressed  in  all  three  libraries,  or  in 


Table  1 

Distribution  of  P.  yoelii  sequences  and  clusters 


Database 

Number  of  ESTs  (%) 

Number  of  clusters  (%) 

AT  (%) 

Mouse 

19(1.1) 

16(1.1) 

55.6 

lRNA 

41  (2.4) 

12  (0.9) 

60.9 

Py  Genome 

1508  (90.8) 

1226  (91.1) 

70.7 

Py  CDS 

1269  (76.3) 

975  (72.4) 

70.4 

Pf  proteins3 

6  (0.3) 

6  (0.4) 

78.2 

No  matchb 

87  (5) 

85  (6) 

51.9 

11  ESTs  that  did  not  match  sequence  in  the  P.  yoelii  genomic  database,  but 
did  match  to  the  P.  falciparum  proteome  (by  Blastx),  with  a  cutoff  of  le  —  15. 

b  ESTs  that  did  not  demonstrate  a  significant  match  to  sequence  in  any 
database. 


any  other  combination.  The  complete  tables  listing  the  stage 
distribution  of  the  ESTs,  and  their  corresponding  genes,  are  in 
Tables  S4-S7  found  in  supporting  information  online.  These 
tables  have  been  hyperlinked  to  appropriate  databases  to  al¬ 
low  efficient  utilization  of  the  EST  library  to  identify  genes 
or  proteins. 

This  approach  of  comparing  of  identified  genes  was  pre¬ 
ferred  rather  than  comparing  the  EST  sequences  to  each  other, 
because  multiple  ESTs  could  map  to  different  regions  of  the 
same  gene,  but  show  no  match  to  each  other.  This  could  po¬ 
tentially  bias  the  analysis  and  make  stage  specific  compar¬ 
isons  misleading.  Our  analysis  also  eliminated  problems  that 
could  derive  from  differences  in  library  construction,  such  as 
biasing  or  not  for  5’  regions  of  mRNA.  The  distribution  of 
the  identified  genes  among  the  different  developmental  stages 
was  consistent  with  the  time  of  development  at  which  the  liver 
parasites  were  harvested.  A  very  small  percentage  (1.9%)  of 
the  total  number  of  genes  were  found  to  match  expressed 
genes  previously  found  uniquely  in  the  sporozoite  library, 
while  a  higher  percentage  (16%)  had  significant  matches  to 
both  sporozoite  and  blood  stage  sequences.  As  might  be  ex¬ 
pected,  based  upon  the  time  in  development  when  the  para¬ 
sites  were  harvested,  a  large  percentage  of  the  genes  (56.6%) 
had  matches  to  blood  stage  genes  identified  by  the  blood 
stage  EST  library  (Table  2).  The  most  significant  finding, 
however,  was  that  25%  of  the  genes  identified  by  the  liver 
stage  ESTs,  did  not  match  either  sporozoite  or  blood  stage 
expressed  genes.  This  suggested  that  a  considerable  fraction 
of  these  ESTs,  from  our  library,  were  liver  stage  specific 
and  represented  message  for  proteins  that  were  uniquely  ex¬ 
pressed  during  this  stage  of  development.  This  finding  that 
a  significant  fraction  of  the  library  represents  potential  stage 
specific  expression  is  similar  to  the  results  seen  in  a  proteomic 
analysis  of  multiple  P.  falciparum  stages  [26].  Those  results 
identified  between  20  and  33%  of  the  blood  stage  proteins  as 
being  specific  to  trophozoites,  merozoites  or  gametocytes. 

3.3.  RT-PCR  analysis 

RT-PCR  analysis  was  done  to  confirm  that  the  expression 
patterns  identified  by  comparing  the  EST  libraries,  from  the 


Table  2 

Genes  identified  from  the  P.  yoelii  liver  stage  library  demonstrating  signifi¬ 
cant  matches  to  genes  identified  by  Blast  analysis  of  P.  yoelii  sporozoite  or 
blood  stage  EST  libraries  (based  on  BlastN  cutoff  of  le  —  12) 


Database 

Number  of 

matches 

Total  (%) 

Sporozoite  only 

12a 

1.9b 

Blood  stage  only 

353 

56.7 

Sporozoite  and  blood  stage 

100 

16 

Not  matching  either  sporozoite  or  blood 
stage,  bur  matching  predicted  CDS 

158 

25.3 

Total 

623 

100 

3  Number  of  non-redundant  predicted  genes. 

b  Percent  of  non-redundant  genes  identified  in  a  specific  subset  in  relation 
to  all  of  the  non-redundant  genes  identified  by  the  liver  stage  cDNA  library. 
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Fig.  2.  Representative  RT-PCR  analysis  of  P.  yoelii  gene  sequences  present 
in  the  in  vivo  liver  stage  EST  library.  RT-PCR  analysis  was  done  using 
primers  for  genes  identified  from  the  Blast  searches  as  being  expressed 
during  sporozoite  and  liver  stages  (PY03684),  liver  stage  only  (PY04760), 
liver  and  blood  stages  (PY04760)  and  sporozoite,  liver  and  blood  stages 
(PY05916).  Total  RNA  from  infected  liver  (LV),  sporozoites  (SPZ)  and  in¬ 
fected  red  blood  cells  (RBC)  was  subjected  to  first  strand  synthesis  with 
(+)  or  without  (— )  reverse  transcriptase.  The  cDNA  was  then  used  as  the 
template  for  PCR  with  the  gene  specific  primers. 


different  stages  of  the  parasite,  were  accurate.  As  it  would 
have  been  impractical  to  analyze  all  of  the  genes,  a  small 
group  of  genes  were  selected  for  amplification  from  the  dif¬ 
ferent  expression  groups  (sporozoite  and  liver;  liver  only, 
liver,  blood  stage  and  sporozoite;  liver  and  blood  stage). 
Genes  with  putative  identifications  as  well  as  genes  that 
coded  for  proteins  with  unknown  function  were  included 
(Table  S9).  Additionally,  some  of  the  genes  were  chosen 
because  they  contained  putative  signal  sequences,  transmem¬ 
brane  domains  or  represented  potential  virulence  factors.  The 
primer  sequences  that  were  designed  to  amplify  the  3'  regions 
of  the  selected  transcripts  are  detailed  in  Table  S 10.  Represen¬ 
tative  results  for  the  RT-PCR  analysis  are  presented  in  Fig.  2. 
In  all  cases,  controls  were  done  in  which  first  strand  synthesis 
did  not  contain  reverse  transcriptase.  These  were  uniformly 
negative,  demonstrating  that  the  PCR  products  were  the  result 
of  amplification  from  cDNA.  Additionally,  the  PCR  prod¬ 
ucts  were  cloned  and  sequenced  to  verify  that  the  products 
matched  the  gene  being  assessed. 

Expression  patterns  in  the  different  stages  were,  for  the 
most  part,  consistent  with  those  predicted  from  the  database 
(Table  3).  In  several  cases,  however,  genes  that  were  predicted 
to  have  expression  in  specific  stages  did  not  produce  RT-PCR 
products  or  were  RT-PCR  positive  when  there  was  no  predic- 


Table  3 

RT-PCR  analysis  of  P.  yoelii  predicted  gene  expression 


Gene  locusa 

RT-PCR 

Sporozoite 

Liver  stage 

Blood  stage 

SPZ-LVb 

PY00150 

+ 

+ 

PY03269 

+ 

+ 

PY04496 

+ 

+ 

PY04658 

+ 

+ 

LV 

PY01013 

+ 

PY01200 

+ 

PY04760 

+ 

PY05129 

+ 

PY05811 

+ 

PY06882 

+ 

+ 

LV-BLD 

PY00516 

+ 

PY00487 

+ 

+ 

PY00972 

+ 

+ 

PY03267 

+ 

+ 

PY04945 

+ 

+ 

PY07320 

+ 

+ 

SPZ  LV  BIT) 

PY00479 

+ 

+ 

PY01412 

+ 

+ 

PY03207 

+ 

+ 

PY05916 

+ 

+ 

+ 

11  The  Institute  for  Genome  Research  Locus  identifier  for  predicted  P.  yoelii 
coding  regions. 

b  Stage  specific  expression  of  putative  genes  as  identified  from  EST 
database  analysis  of  sporozoite  (SPZ),  liver  (LV)  and  blood  (BLD). 

tion  for  gene  expression.  It  is  possible  that  these  transcripts 
were  of  low  abundance  in  the  sporozoites  or  blood  stages, 
requiring  higher  amounts  of  cDNA  template  to  produce  a 
positive  PCR  product  and  in  the  case  of  positives  where  no 
product  was  predicted,  it  is  certainly  possible  that  expres¬ 
sion  was  not  represented  in  the  EST  libraries.  Of  interest  was 
the  liver  stage  specific  expression  of  the  six  genes  chosen 
from  the  database  for  RT-PCR  validation.  These  genes  were 
predicted  to  be  expressed  in  only  the  liver  stage  and  it  was 
encouraging  to  find  that  5  of  6  produced  a  positive  RT-PCR 
product  from  liver  stage  material. 

3.4.  Categorization  of  putative  function 

The  non-redundant  protein  database  was  searched  by 
Blastx  analysis,  and  possible  functions  for  the  P.  yoelii  liver 
stage  clusters  were  assigned  (Fig.  3).  As  described  in  previ¬ 
ous  reports  of  functional  classifications  for  other  stages  of  the 
parasite’s  lifecycle  [6,7],  a  majority  of  the  ESTs  (58%)  repre¬ 
sented  proteins  with  an  unknown  function.  Genes  involved  in 
protein  synthesis  and  nuclear  regulation,  while  representing 
a  relatively  small  percentage  of  the  total,  were  the  next  most 
common  groups.  This  was  not  surprising,  as  development  is 
very  rapid  and  the  processes  that  support  the  rapid  growth  of 
the  parasite  would  be  significantly  represented.  These  find¬ 
ings  were  in  contrast  to  the  results  obtained  from  a  P.  yoelii 
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Functional  Classification  of  P.  Yoelii  Liver  Stage  ESTs 

Cytoskeletal  (1 .0%) 
Transporters  (1 .4%) 

Protein  Processing  (1%) 

Proteosome  proteins  (1 .5%) 

Chaperones  (2.4%]^ 

Signal  Transduction  (3.7%^ 

Virulence  Factors  {5.4%)/ 


Metabolism  (5.9%) 
Nuclear  Regulation  (8.9%) 


I  Unknown  Function  (57.5%) 


Protein  Synthesis  (1 1 .2%) 


Fig.  3.  Functional  profile  of  P.  yoelii  genes  identified  by  liver  stage  EST 
library.  The  results  of  a  Blast  analysis  of  the  P.  yoelii  liver  stage  EST  library 
identified  632  genes  that  were  expressed  during  development  in  the  hepato- 
cyte.  Further  Blastx  evaluation  assigned  a  putative  biological  function  to  the 
matches  and  the  genes  were  then  clustered  and  are  presented  as  a  percentage 
of  the  total.  The  specific  ascribed  function  for  an  expressed  gene  can  be 
found  in  the  supplemental  tables. 

sporozoite  library,  in  which  it  was  found  that  ribosomal  pro¬ 
teins  and  other  proteins  involved  in  protein  synthesis  were 
underrepresented  [8].  This  is  not  unexpected,  as  sporozoites 
are  developmentally  mature  and  would  not  have  nearly  the 
same  level  of  transcriptional  and  translational  activity  as  par¬ 
asites  in  a  developing  schizont.  Chaperones  and  heat  shock 
proteins  made  up  a  relatively  small  portion  of  the  ESTs. 

3.5.  Identification  of  orthologous  P.  falciparum  genes 

Blastx  analysis,  of  the  EST  library,  was  done  to  identify 
potential  orthologous  P.  falciparum  genes.  Using  a  cut  off 
value  of  le  —  15,  438  EST  clusters  were  found  to  have  sig¬ 
nificant  matches  to  P.  falciparum  transcripts.  A  spreadsheet 
in  the  supporting  online  material  (Table  S8)  identifies  the  P. 
yoelii  gene  locus  and  the  corresponding  homologous  P.  fal¬ 
ciparum  locus.  As  might  be  expected,  there  were  a  number 
of  genes  involved  in  DNA  replication  and  protein  synthesis 
(ribosomal  proteins  predominantly)  as  well  as  other  house¬ 
keeping  functions  that  shared  a  high  degree  of  homology 
between  the  two  species.  As  seen  in  assessing  the  putative 
function  of  the  P.  yoelii  genes,  almost  half  of  the  identified 
orthologues  coded  for  hypothetical  proteins. 


4.  Discussion 

Infected  hepatocytes  are  an  important  target  of  a  protec¬ 
tive  immune  response  during  a  P.  yoelii  infection  [27-29]. 
This  finding  makes  the  identification  and  characterization 
of  plasmodial  antigens  expressed  in  the  infected  hepatocyte 
and  the  immune  responses  against  these  antigens  crucial  for 
the  development  of  a  pre-erythrocytic  stage  malaria  vaccine. 
Additionally,  understanding  the  biology  associated  with  the 
development  of  the  parasite  in  the  hepatocyte  will  not  only 
further  our  understanding  of  this  part  of  the  lifecycle,  but  may 
provide  insights  into  the  development  of  effective  chemother¬ 
apeutic  s.  Our  initial  identification  of  in  vivo  liver  stage  gene 


expression  has  demonstrated  that  this  stage  of  the  parasites’ 
lifecycle  is  amenable  to  molecular  analysis  and  that  the  in 
vivo  differential  expression  of  the  plasmodial  liver  stage  tran- 
scriptome  can  be  assessed. 

Work  published  recently  by  Wang  et  al.  [17]  used  an  in 
vitro  manipulation  of  P.  yoelii  sporozoites  to  produce  early 
exo-erythrocytic  stage  parasites  from  which  a  cDNA  library 
was  generated.  While  there  were  some  similarities  between 
that  library  and  our  EST  library,  there  were  also  substantial 
differences  that  were  most  likely  related  to  the  parasite  stage 
from  which  RNA  was  extracted.  A  Blast  analysis  compar¬ 
ing  the  two  libraries  found  98  genes  that  were  common  to 
both  libraries  and  also  found  in  the  available  databases  for  P. 
yoelii  sporozoites  and  blood  stages.  Additionally,  ESTs  from 
both  libraries  identified  2 1  genes  that  were  not  found  in  any 
of  the  P.  yoelii  expression  databases  (putatively  liver  stage 
specific),  but  were  common  to  our  library  and  the  Wang  li¬ 
brary.  As  reported  by  Wang,  we  also  detected  expression  of 
message  for  the  P.  yoelii  PfEMP3-like  molecules,  merozoite 
surface  protein- 1  (MSP-1)  and  ring-infected  erythrocyte  sur¬ 
face  antigen  (RES  A),  but  unlike  Wang  found  no  ESTs  in  our 
library  for  the  circumsporozoite  protein  (CS)  and  sporozoite 
surface  protein  2  (SSP2).  This  is  consistent  with  our  previ¬ 
ous  work,  in  which  we  were  unable  to  amplify  message  for 
CS  from  LCM  liver  stage  schizonts  [18].  Additionally,  we 
had  representation  of  PyHEP17  message,  an  antigen  that  is 
the  P.  yoelii  orthologue  of  P.  falciparum  exported  protein- 1 
(EXP-1).  PyHEP17  is  expressed  very  early  during  liver  stage 
development  and  subsequently  into  the  blood  stage.  We  also 
found  some  expression  of  message  for  heat  shock  proteins 
and  chaperones,  but  they  represented  only  a  little  over  2% 
of  the  ESTs.  The  presence  of  ESTs  that  represented  a  broad 
range  of  gene  expression  and  the  lack  of  many  sporozoite 
transcriptional  elements  suggested  that  this  library  was  rep¬ 
resentative  of  the  maturing  in  vivo  liver  stage  transcriptome. 

While  the  EST  library  described  in  this  work  was  made 
from  relatively  late  stage  parasites  (~40  h)  we  have  recently 
been  able  to  microdissect  parasites  from  as  early  as  30  h  after 
infection  (data  not  shown)  by  using  an  immunofluorescent 
staining  protocol  to  visualize  the  early  developing  schizonts. 
The  comparison  of  transcription  at  this  earlier  stage  with  the 
more  mature  parasites  may  provide  insights  into  the  changes 
that  occur  in  gene  expression  as  the  parasite  goes  through  the 
transitional  liver  stage. 

The  most  interesting  and  difficult  aspect  of  this  analysis 
has  been  the  large  number  of  genes  identified  that  code  for 
proteins  with  no  known  function.  These  genes  are  of  interest, 
as  they  do  not  appear  in  any  database  and  represent  poten¬ 
tially  unique  targets.  The  recent  publication  of  methodologies 
to  rapidly  test  hundreds  of  unknown  genes  [30,31]  will  likely 
provide  information  about  these  proteins  with  unknown  func¬ 
tions  and  aid  in  the  selection  of  drug  and  vaccine  targets.  A 
comprehensive  analysis,  of  these  genes,  is  beyond  the  scope 
of  this  initial  work  and  will  require  extensive  future  studies.  It 
is  clear  that  there  is  a  tremendous  potential  for  thorough  gene 
analysis  at  the  liver  stage  and  it  should  now  be  possible  to 
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utilize  the  genomic  data  that  has  been  made  available  through 
genome  sequencing  projects  to  accomplish  this  aim. 
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